Rank in Wordlist | Frequency | Word |
---|---|---|
1187 | 2 | 20,000 |
1198 | 2 | 4,000 |
1201 | 2 | 5,000 |
1202 | 2 | 50,000 |
1204 | 2 | 7,107 |
1930 | 1 | 1,494 |
1931 | 1 | 1,600m |
1933 | 1 | 10,000 |
1934 | 1 | 10,180,000 |
1935 | 1 | 10,454,400 |
Rank in Wordlist | Frequency | Word |
---|---|---|
1344 | 2 | Krais(t |
Rank in Wordlist | Frequency | Word |
---|---|---|
496 | 6 | country's |
996 | 3 | b'long |
1274 | 2 | Earth's |
1303 | 2 | Grimm's |
1499 | 2 | Vasa's |
1916 | 2 | world's |
2135 | 1 | America's |
2161 | 1 | Asia's |
2175 | 1 | Ba'ath |
2178 | 1 | Baha'i |
Rank in Wordlist | Frequency | Word |
---|---|---|
3189 | 1 | UTC+1 |
Rank in Wordlist | Frequency | Word |
---|---|---|
2004 | 1 | 1997/1998 |
2043 | 1 | 337/1992 |
2328 | 1 | DOS/Windows |
3188 | 1 | UNU/Wider |
3352 | 1 | and/or |
3900 | 1 | http://www.eesti.ee |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots